Overview
Brought to you by YData
Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 5000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 657.8 KiB |
| Average record size in memory | 134.7 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 1 |
PM10 is highly overall correlated with PM2.5 | High correlation |
PM2.5 is highly overall correlated with PM10 | High correlation |
SO2 has 169 (3.4%) zeros | Zeros |
Proximity_to_Industrial_Areas has 60 (1.2%) zeros | Zeros |
Reproduction
| Analysis started | 2024-12-02 21:35:18.978243 |
|---|---|
| Analysis finished | 2024-12-02 21:35:25.192385 |
| Duration | 6.21 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
Temperature
Real number (ℝ)
| Distinct | 331 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.46458 |
| Minimum | 3.5 |
|---|---|
| Maximum | 46.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 3.5 |
|---|---|
| 5-th percentile | 16.9 |
| Q1 | 21.8 |
| median | 25.3 |
| Q3 | 28.9 |
| 95-th percentile | 34.7 |
| Maximum | 46.2 |
| Range | 42.7 |
| Interquartile range (IQR) | 7.1 |
Descriptive statistics
| Standard deviation | 5.486219 |
|---|---|
| Coefficient of variation (CV) | 0.2154451 |
| Kurtosis | 0.60280894 |
| Mean | 25.46458 |
| Median Absolute Deviation (MAD) | 3.5 |
| Skewness | 0.23391162 |
| Sum | 127322.9 |
| Variance | 30.098599 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 25.8 | 50 | 1.0% |
| 26.7 | 48 | 1.0% |
| 28 | 47 | 0.9% |
| 22.7 | 47 | 0.9% |
| 24.9 | 47 | 0.9% |
| 25.4 | 47 | 0.9% |
| 26.4 | 46 | 0.9% |
| 24.8 | 46 | 0.9% |
| 23.1 | 46 | 0.9% |
| 25.5 | 46 | 0.9% |
| Other values (321) | 4530 |
| Value | Count | Frequency (%) |
| 3.5 | 1 | |
| 5.2 | 1 | |
| 5.3 | 1 | |
| 6.1 | 2 | |
| 7.5 | 1 | |
| 8.3 | 1 | |
| 8.5 | 1 | |
| 8.6 | 2 | |
| 9.2 | 1 | |
| 9.7 | 1 |
| Value | Count | Frequency (%) |
| 46.2 | 1 | < 0.1% |
| 45.9 | 1 | < 0.1% |
| 45.8 | 2 | |
| 45.4 | 3 | |
| 45.2 | 1 | < 0.1% |
| 44.9 | 1 | < 0.1% |
| 44.7 | 2 | |
| 44.6 | 1 | < 0.1% |
| 44.2 | 2 | |
| 44.1 | 1 | < 0.1% |
Humidity
Real number (ℝ)
| Distinct | 724 |
|---|---|
| Distinct (%) | 14.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60.06814 |
| Minimum | 10 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 34.9 |
| Q1 | 49.9 |
| median | 60.2 |
| Q3 | 70.1 |
| 95-th percentile | 84.9 |
| Maximum | 100 |
| Range | 90 |
| Interquartile range (IQR) | 20.2 |
Descriptive statistics
| Standard deviation | 15.044806 |
|---|---|
| Coefficient of variation (CV) | 0.25046233 |
| Kurtosis | -0.090625585 |
| Mean | 60.06814 |
| Median Absolute Deviation (MAD) | 10.1 |
| Skewness | -0.039819814 |
| Sum | 300340.7 |
| Variance | 226.3462 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 58.3 | 23 | 0.5% |
| 61 | 20 | 0.4% |
| 57.3 | 20 | 0.4% |
| 60.2 | 20 | 0.4% |
| 61.6 | 20 | 0.4% |
| 64.6 | 20 | 0.4% |
| 100 | 19 | 0.4% |
| 62.4 | 19 | 0.4% |
| 66.2 | 19 | 0.4% |
| 63.4 | 19 | 0.4% |
| Other values (714) | 4801 |
| Value | Count | Frequency (%) |
| 10 | 4 | |
| 13 | 1 | < 0.1% |
| 14.4 | 1 | < 0.1% |
| 15.4 | 1 | < 0.1% |
| 15.5 | 2 | |
| 16 | 1 | < 0.1% |
| 16.4 | 1 | < 0.1% |
| 17.4 | 1 | < 0.1% |
| 17.6 | 1 | < 0.1% |
| 17.9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 19 | |
| 99.8 | 2 | < 0.1% |
| 99.7 | 1 | < 0.1% |
| 99.4 | 1 | < 0.1% |
| 99.3 | 1 | < 0.1% |
| 99.2 | 2 | < 0.1% |
| 98.9 | 1 | < 0.1% |
| 98.6 | 1 | < 0.1% |
| 98.4 | 2 | < 0.1% |
| 97.9 | 2 | < 0.1% |
PM2.5
Real number (ℝ)
High correlation 
| Distinct | 1008 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.90558 |
| Minimum | 0 |
|---|---|
| Maximum | 249 |
| Zeros | 9 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.5 |
| Q1 | 8.5 |
| median | 20.6 |
| Q3 | 41.5 |
| 95-th percentile | 91.205 |
| Maximum | 249 |
| Range | 249 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 30.285899 |
|---|---|
| Coefficient of variation (CV) | 1.0127173 |
| Kurtosis | 5.8785757 |
| Mean | 29.90558 |
| Median Absolute Deviation (MAD) | 14.3 |
| Skewness | 2.0250291 |
| Sum | 149527.9 |
| Variance | 917.23568 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.5 | 23 | 0.5% |
| 2 | 21 | 0.4% |
| 12.7 | 21 | 0.4% |
| 8.4 | 20 | 0.4% |
| 3.5 | 20 | 0.4% |
| 2.6 | 20 | 0.4% |
| 5.3 | 20 | 0.4% |
| 1.3 | 20 | 0.4% |
| 7.6 | 20 | 0.4% |
| 1.9 | 20 | 0.4% |
| Other values (998) | 4795 |
| Value | Count | Frequency (%) |
| 0 | 9 | |
| 0.1 | 12 | |
| 0.2 | 15 | |
| 0.3 | 13 | |
| 0.4 | 19 | |
| 0.5 | 17 | |
| 0.6 | 19 | |
| 0.7 | 16 | |
| 0.8 | 19 | |
| 0.9 | 12 |
| Value | Count | Frequency (%) |
| 249 | 1 | |
| 241.6 | 1 | |
| 223 | 1 | |
| 214.4 | 1 | |
| 214.3 | 1 | |
| 213.7 | 1 | |
| 212.8 | 1 | |
| 209.2 | 1 | |
| 204.3 | 1 | |
| 199.6 | 1 |
PM10
Real number (ℝ)
High correlation 
| Distinct | 1098 |
|---|---|
| Distinct (%) | 22.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.0037 |
| Minimum | -1.4 |
|---|---|
| Maximum | 256.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 3 |
| Negative (%) | 0.1% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -1.4 |
|---|---|
| 5-th percentile | 9.195 |
| Q1 | 18.9 |
| median | 31.1 |
| Q3 | 51.5 |
| 95-th percentile | 103.4 |
| Maximum | 256.1 |
| Range | 257.5 |
| Interquartile range (IQR) | 32.6 |
Descriptive statistics
| Standard deviation | 30.693124 |
|---|---|
| Coefficient of variation (CV) | 0.76725712 |
| Kurtosis | 5.4487277 |
| Mean | 40.0037 |
| Median Absolute Deviation (MAD) | 14.4 |
| Skewness | 1.9381335 |
| Sum | 200018.5 |
| Variance | 942.06785 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17.4 | 21 | 0.4% |
| 14.3 | 21 | 0.4% |
| 22 | 20 | 0.4% |
| 21.7 | 20 | 0.4% |
| 18.2 | 18 | 0.4% |
| 17.8 | 18 | 0.4% |
| 21.3 | 18 | 0.4% |
| 18.3 | 18 | 0.4% |
| 21.1 | 18 | 0.4% |
| 14.5 | 16 | 0.3% |
| Other values (1088) | 4812 |
| Value | Count | Frequency (%) |
| -1.4 | 1 | < 0.1% |
| -1 | 1 | < 0.1% |
| -0.7 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.6 | 1 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 1.5 | 1 | < 0.1% |
| 1.7 | 1 | < 0.1% |
| 1.8 | 3 |
| Value | Count | Frequency (%) |
| 256.1 | 1 | |
| 239 | 1 | |
| 234 | 1 | |
| 228.5 | 1 | |
| 223.8 | 1 | |
| 223.2 | 1 | |
| 222.2 | 1 | |
| 219.3 | 1 | |
| 217 | 1 | |
| 214.6 | 1 |
NO2
Real number (ℝ)
| Distinct | 598 |
|---|---|
| Distinct (%) | 12.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.00036 |
| Minimum | -13.5 |
|---|---|
| Maximum | 96.4 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 102 |
| Negative (%) | 2.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -13.5 |
|---|---|
| 5-th percentile | 3.9 |
| Q1 | 13.8 |
| median | 20.5 |
| Q3 | 27.5 |
| 95-th percentile | 38.805 |
| Maximum | 96.4 |
| Range | 109.9 |
| Interquartile range (IQR) | 13.7 |
Descriptive statistics
| Standard deviation | 11.30099 |
|---|---|
| Coefficient of variation (CV) | 0.53813315 |
| Kurtosis | 2.5606544 |
| Mean | 21.00036 |
| Median Absolute Deviation (MAD) | 6.8 |
| Skewness | 0.7506528 |
| Sum | 105001.8 |
| Variance | 127.71237 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22.4 | 30 | 0.6% |
| 18.3 | 26 | 0.5% |
| 14.6 | 26 | 0.5% |
| 23.9 | 25 | 0.5% |
| 15.7 | 25 | 0.5% |
| 20.6 | 24 | 0.5% |
| 21.2 | 24 | 0.5% |
| 22.1 | 24 | 0.5% |
| 23.6 | 24 | 0.5% |
| 21.4 | 24 | 0.5% |
| Other values (588) | 4748 |
| Value | Count | Frequency (%) |
| -13.5 | 1 | |
| -11.7 | 1 | |
| -11.5 | 1 | |
| -11.3 | 1 | |
| -10.4 | 1 | |
| -9.4 | 1 | |
| -9.1 | 1 | |
| -9 | 1 | |
| -8.5 | 1 | |
| -8.2 | 1 |
| Value | Count | Frequency (%) |
| 96.4 | 1 | |
| 88 | 1 | |
| 86.1 | 1 | |
| 81.8 | 1 | |
| 80.3 | 1 | |
| 79.4 | 1 | |
| 77.5 | 1 | |
| 76.7 | 1 | |
| 75.9 | 1 | |
| 75.7 | 1 |
SO2
Real number (ℝ)
Zeros 
| Distinct | 360 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.14106 |
| Minimum | 0 |
|---|---|
| Maximum | 41.7 |
| Zeros | 169 |
| Zeros (%) | 3.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.6 |
| Q1 | 9.9 |
| median | 15.1 |
| Q3 | 20.4 |
| 95-th percentile | 27.7 |
| Maximum | 41.7 |
| Range | 41.7 |
| Interquartile range (IQR) | 10.5 |
Descriptive statistics
| Standard deviation | 7.6684659 |
|---|---|
| Coefficient of variation (CV) | 0.50646823 |
| Kurtosis | -0.27597557 |
| Mean | 15.14106 |
| Median Absolute Deviation (MAD) | 5.3 |
| Skewness | 0.094924674 |
| Sum | 75705.3 |
| Variance | 58.805369 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 169 | 3.4% |
| 14 | 35 | 0.7% |
| 16.2 | 33 | 0.7% |
| 11.5 | 32 | 0.6% |
| 11.1 | 31 | 0.6% |
| 11.6 | 31 | 0.6% |
| 16.6 | 31 | 0.6% |
| 14.9 | 31 | 0.6% |
| 12.2 | 31 | 0.6% |
| 14.3 | 31 | 0.6% |
| Other values (350) | 4545 |
| Value | Count | Frequency (%) |
| 0 | 169 | |
| 0.1 | 1 | < 0.1% |
| 0.2 | 7 | 0.1% |
| 0.3 | 5 | 0.1% |
| 0.4 | 4 | 0.1% |
| 0.5 | 5 | 0.1% |
| 0.6 | 5 | 0.1% |
| 0.7 | 4 | 0.1% |
| 0.8 | 6 | 0.1% |
| 0.9 | 7 | 0.1% |
| Value | Count | Frequency (%) |
| 41.7 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 39.8 | 1 | < 0.1% |
| 39.3 | 1 | < 0.1% |
| 38.9 | 1 | < 0.1% |
| 38.6 | 1 | < 0.1% |
| 38.4 | 3 | |
| 38.3 | 1 | < 0.1% |
| 38.2 | 1 | < 0.1% |
| 38 | 1 | < 0.1% |
CO
Real number (ℝ)
| Distinct | 186 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.904314 |
| Minimum | -0.08 |
|---|---|
| Maximum | 2.14 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 8 |
| Negative (%) | 0.2% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | -0.08 |
|---|---|
| 5-th percentile | 0.42 |
| Q1 | 0.7 |
| median | 0.905 |
| Q3 | 1.1 |
| 95-th percentile | 1.39 |
| Maximum | 2.14 |
| Range | 2.22 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.29784014 |
|---|---|
| Coefficient of variation (CV) | 0.32935479 |
| Kurtosis | 0.019793885 |
| Mean | 0.904314 |
| Median Absolute Deviation (MAD) | 0.205 |
| Skewness | -0.011361633 |
| Sum | 4521.57 |
| Variance | 0.088708751 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.81 | 86 | 1.7% |
| 0.88 | 79 | 1.6% |
| 1.09 | 79 | 1.6% |
| 0.84 | 76 | 1.5% |
| 1.01 | 75 | 1.5% |
| 0.93 | 75 | 1.5% |
| 0.87 | 73 | 1.5% |
| 1 | 72 | 1.4% |
| 0.86 | 71 | 1.4% |
| 0.95 | 69 | 1.4% |
| Other values (176) | 4245 |
| Value | Count | Frequency (%) |
| -0.08 | 1 | |
| -0.07 | 2 | |
| -0.05 | 2 | |
| -0.03 | 2 | |
| -0.02 | 1 | |
| 0 | 1 | |
| 0.01 | 1 | |
| 0.02 | 1 | |
| 0.03 | 1 | |
| 0.05 | 1 |
| Value | Count | Frequency (%) |
| 2.14 | 1 | |
| 1.9 | 1 | |
| 1.87 | 1 | |
| 1.82 | 1 | |
| 1.81 | 1 | |
| 1.8 | 2 | |
| 1.79 | 1 | |
| 1.78 | 1 | |
| 1.77 | 1 | |
| 1.76 | 1 |
Proximity_to_Industrial_Areas
Real number (ℝ)
Zeros 
| Distinct | 263 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.03188 |
| Minimum | 0 |
|---|---|
| Maximum | 46.3 |
| Zeros | 60 |
| Zeros (%) | 1.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.295 |
| Q1 | 1.5 |
| median | 3.5 |
| Q3 | 6.9 |
| 95-th percentile | 15.105 |
| Maximum | 46.3 |
| Range | 46.3 |
| Interquartile range (IQR) | 5.4 |
Descriptive statistics
| Standard deviation | 5.0103521 |
|---|---|
| Coefficient of variation (CV) | 0.9957217 |
| Kurtosis | 5.6367404 |
| Mean | 5.03188 |
| Median Absolute Deviation (MAD) | 2.4 |
| Skewness | 1.9826731 |
| Sum | 25159.4 |
| Variance | 25.103628 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.2 | 98 | 2.0% |
| 0.3 | 97 | 1.9% |
| 0.1 | 92 | 1.8% |
| 0.5 | 89 | 1.8% |
| 0.7 | 89 | 1.8% |
| 0.8 | 87 | 1.7% |
| 0.4 | 83 | 1.7% |
| 0.6 | 82 | 1.6% |
| 1 | 82 | 1.6% |
| 1.6 | 78 | 1.6% |
| Other values (253) | 4123 |
| Value | Count | Frequency (%) |
| 0 | 60 | |
| 0.1 | 92 | |
| 0.2 | 98 | |
| 0.3 | 97 | |
| 0.4 | 83 | |
| 0.5 | 89 | |
| 0.6 | 82 | |
| 0.7 | 89 | |
| 0.8 | 87 | |
| 0.9 | 67 |
| Value | Count | Frequency (%) |
| 46.3 | 1 | |
| 39.7 | 1 | |
| 38.5 | 1 | |
| 38 | 1 | |
| 35.5 | 1 | |
| 34.4 | 1 | |
| 33.2 | 1 | |
| 31.4 | 1 | |
| 29.8 | 1 | |
| 28.7 | 1 |
Population_Density
Real number (ℝ)
| Distinct | 108 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 299.9482 |
| Minimum | 243 |
|---|---|
| Maximum | 358 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 39.2 KiB |
Quantile statistics
| Minimum | 243 |
|---|---|
| 5-th percentile | 272 |
| Q1 | 288 |
| median | 300 |
| Q3 | 311 |
| 95-th percentile | 329 |
| Maximum | 358 |
| Range | 115 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 17.215133 |
|---|---|
| Coefficient of variation (CV) | 0.057393685 |
| Kurtosis | -0.099207536 |
| Mean | 299.9482 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.056287035 |
| Sum | 1499741 |
| Variance | 296.36079 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 307 | 124 | 2.5% |
| 293 | 123 | 2.5% |
| 303 | 121 | 2.4% |
| 294 | 121 | 2.4% |
| 296 | 118 | 2.4% |
| 291 | 118 | 2.4% |
| 305 | 116 | 2.3% |
| 302 | 115 | 2.3% |
| 309 | 111 | 2.2% |
| 298 | 109 | 2.2% |
| Other values (98) | 3824 |
| Value | Count | Frequency (%) |
| 243 | 1 | < 0.1% |
| 247 | 1 | < 0.1% |
| 249 | 1 | < 0.1% |
| 250 | 2 | |
| 251 | 2 | |
| 252 | 1 | < 0.1% |
| 253 | 2 | |
| 254 | 2 | |
| 255 | 2 | |
| 256 | 4 |
| Value | Count | Frequency (%) |
| 358 | 1 | < 0.1% |
| 357 | 3 | |
| 356 | 1 | < 0.1% |
| 355 | 1 | < 0.1% |
| 353 | 1 | < 0.1% |
| 352 | 3 | |
| 350 | 1 | < 0.1% |
| 349 | 2 | |
| 346 | 3 | |
| 345 | 3 |
Air Quality
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 306.3 KiB |
| Good | |
|---|---|
| Moderate | |
| Poor | |
| Hazardous |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 5.7 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Hazardous |
|---|---|
| 2nd row | Good |
| 3rd row | Good |
| 4th row | Poor |
| 5th row | Poor |
Common Values
| Value | Count | Frequency (%) |
| Good | 2000 | |
| Moderate | 1500 | |
| Poor | 1000 | |
| Hazardous | 500 | 10.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| good | 2000 | |
| moderate | 1500 | |
| poor | 1000 | |
| hazardous | 500 | 10.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 8000 | |
| d | 4000 | |
| e | 3000 | 10.5% |
| r | 3000 | 10.5% |
| a | 2500 | 8.8% |
| G | 2000 | 7.0% |
| M | 1500 | 5.3% |
| t | 1500 | 5.3% |
| P | 1000 | 3.5% |
| H | 500 | 1.8% |
| Other values (3) | 1500 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23500 | |
| Uppercase Letter | 5000 | 17.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 8000 | |
| d | 4000 | |
| e | 3000 | 12.8% |
| r | 3000 | 12.8% |
| a | 2500 | 10.6% |
| t | 1500 | 6.4% |
| z | 500 | 2.1% |
| u | 500 | 2.1% |
| s | 500 | 2.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2000 | |
| M | 1500 | |
| P | 1000 | |
| H | 500 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28500 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 8000 | |
| d | 4000 | |
| e | 3000 | 10.5% |
| r | 3000 | 10.5% |
| a | 2500 | 8.8% |
| G | 2000 | 7.0% |
| M | 1500 | 5.3% |
| t | 1500 | 5.3% |
| P | 1000 | 3.5% |
| H | 500 | 1.8% |
| Other values (3) | 1500 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28500 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 8000 | |
| d | 4000 | |
| e | 3000 | 10.5% |
| r | 3000 | 10.5% |
| a | 2500 | 8.8% |
| G | 2000 | 7.0% |
| M | 1500 | 5.3% |
| t | 1500 | 5.3% |
| P | 1000 | 3.5% |
| H | 500 | 1.8% |
| Other values (3) | 1500 | 5.3% |
Interactions
Correlations
| Air Quality | CO | Humidity | NO2 | PM10 | PM2.5 | Population_Density | Proximity_to_Industrial_Areas | SO2 | Temperature | |
|---|---|---|---|---|---|---|---|---|---|---|
| Air Quality | 1.000 | 0.014 | 0.000 | 0.009 | 0.000 | 0.000 | 0.000 | 0.000 | 0.006 | 0.004 |
| CO | 0.014 | 1.000 | -0.018 | 0.013 | -0.003 | 0.005 | -0.015 | -0.024 | 0.014 | 0.018 |
| Humidity | 0.000 | -0.018 | 1.000 | 0.003 | -0.004 | -0.010 | -0.010 | 0.002 | -0.023 | -0.017 |
| NO2 | 0.009 | 0.013 | 0.003 | 1.000 | -0.003 | 0.000 | -0.028 | -0.003 | 0.005 | -0.022 |
| PM10 | 0.000 | -0.003 | -0.004 | -0.003 | 1.000 | 0.960 | -0.006 | -0.023 | -0.032 | -0.010 |
| PM2.5 | 0.000 | 0.005 | -0.010 | 0.000 | 0.960 | 1.000 | -0.006 | -0.020 | -0.026 | -0.001 |
| Population_Density | 0.000 | -0.015 | -0.010 | -0.028 | -0.006 | -0.006 | 1.000 | -0.010 | 0.001 | 0.017 |
| Proximity_to_Industrial_Areas | 0.000 | -0.024 | 0.002 | -0.003 | -0.023 | -0.020 | -0.010 | 1.000 | -0.010 | 0.001 |
| SO2 | 0.006 | 0.014 | -0.023 | 0.005 | -0.032 | -0.026 | 0.001 | -0.010 | 1.000 | 0.013 |
| Temperature | 0.004 | 0.018 | -0.017 | -0.022 | -0.010 | -0.001 | 0.017 | 0.001 | 0.013 | 1.000 |
Missing values
Sample
| Temperature | Humidity | PM2.5 | PM10 | NO2 | SO2 | CO | Proximity_to_Industrial_Areas | Population_Density | Air Quality | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 27.2 | 51.7 | 35.1 | 46.2 | 26.7 | 32.2 | 0.98 | 11.2 | 314 | Hazardous |
| 1 | 26.3 | 59.3 | 1.0 | 6.2 | 38.3 | 20.4 | 0.68 | 13.5 | 298 | Good |
| 2 | 27.9 | 73.2 | 20.0 | 39.4 | 19.6 | 5.8 | 0.95 | 5.4 | 309 | Good |
| 3 | 23.9 | 51.9 | 14.7 | 24.3 | 5.2 | 12.6 | 1.24 | 4.5 | 282 | Poor |
| 4 | 25.2 | 59.0 | 26.3 | 30.9 | 26.8 | 13.5 | 1.06 | 5.6 | 293 | Poor |
| 5 | 22.7 | 61.4 | 4.3 | 3.2 | 11.1 | 21.0 | 0.59 | 5.6 | 290 | Good |
| 6 | 31.2 | 67.9 | 49.6 | 62.4 | 26.2 | 14.3 | 1.47 | 4.0 | 313 | Good |
| 7 | 25.1 | 56.9 | 36.8 | 58.7 | 20.4 | 20.4 | 0.82 | 0.4 | 300 | Poor |
| 8 | 35.7 | 75.4 | 101.7 | 115.4 | 36.9 | 2.1 | 1.03 | 2.4 | 295 | Moderate |
| 9 | 24.3 | 61.6 | 0.2 | 15.1 | 26.1 | 17.0 | 0.64 | 1.7 | 311 | Moderate |
| Temperature | Humidity | PM2.5 | PM10 | NO2 | SO2 | CO | Proximity_to_Industrial_Areas | Population_Density | Air Quality | |
|---|---|---|---|---|---|---|---|---|---|---|
| 4990 | 28.5 | 45.6 | 8.9 | 23.9 | 7.7 | 6.2 | 0.50 | 0.4 | 301 | Good |
| 4991 | 32.6 | 77.5 | 23.3 | 32.3 | 13.3 | 11.8 | 1.17 | 8.5 | 311 | Good |
| 4992 | 18.5 | 87.1 | 23.8 | 31.6 | 23.1 | 8.0 | 1.02 | 6.6 | 275 | Good |
| 4993 | 22.5 | 63.2 | 100.4 | 118.1 | 33.4 | 7.9 | 0.53 | 1.6 | 310 | Poor |
| 4994 | 27.2 | 50.6 | 65.5 | 73.5 | 11.5 | 24.9 | 0.96 | 6.0 | 285 | Good |
| 4995 | 29.3 | 36.8 | 80.3 | 90.9 | 9.2 | 14.1 | 0.97 | 10.2 | 287 | Moderate |
| 4996 | 15.7 | 51.7 | 0.7 | 11.4 | 40.5 | 13.8 | 1.07 | 4.2 | 320 | Good |
| 4997 | 27.8 | 48.1 | 8.9 | 16.4 | 8.6 | 17.7 | 0.54 | 0.3 | 302 | Moderate |
| 4998 | 30.4 | 50.4 | 2.2 | 18.8 | 13.1 | 22.3 | 0.94 | 6.7 | 308 | Moderate |
| 4999 | 21.5 | 76.5 | 45.0 | 58.0 | 37.9 | 0.0 | 0.96 | 0.2 | 290 | Hazardous |